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This paper discusses the lexicographical concept of 



lexical functions ([Mcl'cuk and Zolkovsky, 1984 1 ) 
and their potential exploitation in the develop- 
ment of a machine translation lexicon designed 
to handle collocations. We show how lexi- 
cal functions can be thought to reflect cross- 
linguistic meaning concepts for collocational struc- 
tures and their translational equivalents, and 
therefore suggest themselves as some kind of 
language-independent semantic primitives from 
which translation strategies can be developed^] 

1 Description of the Problem 

Collocations present specific problems in transla- 
tion, both in human and automatic contexts. If we 
take the construction heavy smoker in English and 
attempt to translate it into French and German, 
we find that a literal translation of heavy yields the 
wrong result, since the concept expressed by the 
adjective (something like 'excessive') is trans- 
lated by grand (large) in French and stark (strong) 
in German. We observe then that in some sense 
the adjectives stark, grand and heavy are equiv- 
alent in the collocational context, but that this 
is of course not typically the case in other con- 
texts, cf grande boite, starke Schachtel and heavy 
box, where the adjectives could hardly be viewed as 
equivalent. It seems then that adjectives which are 
not literal translations of one another may share 
meaning properties specifically in the collocational 
context. 

How then can we specify this special equivalence 
in the machine translation dictionary? The answer 
seems to lie in addressing the concept which un- 
derlies the union of adjective and noun in these 
three cases, i.e., intensification, and hence estab- 
lish a single meaning representation for the adjec- 
tives which can be viewed as an interlingual pivot 
for translation. 

Collocations 

have been studied by computational linguists in 



lr The research reported in this paper was undertaken as 
the project "Collocations and the Lexicalisation of Seman- 
tic Operations" (ET-10/75). Financial contributions were 
by the Commission of the European Community, Associa- 
tion Suissetra (Geneva) and Oxford University Press. 



different contexts. For instance, there is a sub- 
stantial body of papers on the extraction of "fre- 
quently co-occurring words" from corpora using 
statistical methods (e.g., ([Choueka et at, 1983 1 ) , 



( ]Church and Hanks, 1989| ), ( ]Smadja, 1993[ ) to 
list only a few). These authors focus on tech- 
niques for providing material that can be used 
in other processing tasks such as word sense dis- 
ambiguation, information retrieval, natural lan- 
guage generation and so on. Also, the use 
of collocations in different applications has been 



discussed by various authors (([McRoy, 1992 1 ) , 
( jPustejovsky et al, 1992|), 

([ Bmadja and McKeown, 199(| ) etc.). However, 
collocations are not only considered useful, but 
also a problem b oth in certain appli cations 
(e.g. generation, (| Nircnburg et al, 1988| ), ma- 



chine translation, ([ Hcid and Raab, 1989| ) ) and 
from a more theoretical 

point of view (e.g. (fAbcillc and Schabcs, 19891 ), 



(| Krcnn and Erbach, To appear |)). 

We have been concerned with investigat- 
ing the lexical functions (LFs) of MePcuk 
( flVlel'cuk and Zolkovsky, 1984| ) as a candidate in- 
terlingual device for the translation of adjectival 
and verbal collocates. Our work is related to re- 
search by (| Hcid and Raab, 1989| ). In some re- 
spects it is an extension of some of their sugges- 
tions. Our work differs from theirs in scope and 
also in the exploration of various other directions. 



2 Representation 

The use we make of lexical functions as interlin- 
gual representations, does not respect their orig- 
inal MePcukian interpretation. Furthermore, we 
have transferred them from their context in the 
Meaning- Text Theory to a different theoretical set- 
ting. We have embedded the concept in an HPSG- 
like grammar theory]^] In this section we review 
this operation. First we consider the features of 
Mel'cuk's treatment that we have wanted to pre- 
serve. Next we show how they have been imported 
into the HPSG framework. 



2 Head Driven Phrase 



Structure grammar. 



([ Pollard and Sag, 1987 ]), ( Pollard and Sag, to appear ]). 
For another treatment of collocations in HPSG, see 



([Krenn and Erbach, To appear]) 



2.1 Collocations and LFs 



• Coding the base-collocate relation in the lex- 



In Mel'cuk's Explanatory Combinatory Dictio- 



nary (ECD, see ( fMel'cuk et al, 198J )), expres- 
sions such as une ferme intention, une resistance 
acharnee , un argument de poids, un bruit infernal 
and donner une lecon, faire un pas, commetre un 
crime are described in the lexical combinatorics 
zone. These "expressions plus ou moins figees" 
will be called 'collocations'. They are considered 
to consist of two parts — the base and the collo- 
cate. In the examples above, the nouns are the 
bases and the adjectives and the verbs are the col- 
locates. The idea that all adjective collocates and 
all the verb collocates share an important meaning 
component — roughly paraphrasable as intense 
and do respectively — and the fact that the ad- 
jectives and verbs are not interchangeable but are 
restricted with this meaning to the accompany- 
ing nouns, is coded in the dictionary using lexical 
functions (in this case Magn and Oper). 

Each article in the ECD describes what is called 
a 'lexeme': a word in some specific reading. In 
the lexical combinatorics zone, we find a list of 
the lexical functions that are relevant to this par- 
ticular lexeme. Each lexical function is followed 
by one or more lexemes (the result or value of the 
function applied to the head word). The idea is 
that each combination of the argument with one 
of the values of the function forms a collocation in 
our terminology. The argument corresponds to the 
base and each value is a collocate. The following 
features of this representation are important to us. 

• Lexical functions are used to represent an im- 
portant syntactico-semantic relation between 
the base and the collocate. 

• The restricted combinatorial potential of the 
collocate lexeme is accounted for by listing it 
at each base with which it can occur. 

The second of these characteristics points out 
that the collocational restriction is seen as a purely 
lexical, idiosyncratic one: all collocations are ex- 
plicitly listed. 

One other aspect of collocations which we have 
to deal with is the relation between the collocate 
lexeme and its freely occurring counterpart. Col- 
locate lexemes often differ in some respects from 
their literal variants while sharing other proper- 
ties. Mel'cuk deals with this by including in the 
ECD an entry for the free variant and putting the 
collocate-specific information in the entry for the 
base (with the result of the lexical functions). The 
full entry of the collocate is the result of taking the 
entry for the free variant and overwriting it with 
the information provided at the base. 

2.2 Collocations in HPSG 

The three aspects of Mel'cuk's analysis we wanted 
to encode in HPSG were the following. 



• Choosing the level at which lexical functions 
will be situated. 

• Relating the collocate information to the free 
variant entry. 

We have provided straightforward solutions to 
these problems. For the first problem we have 
taken over the ECD architecture rather directly, 
by creating a dedicated 'collocates' field in the en- 
try for bases which contains all the relevant collo- 
cates. As far as the second problem is concerned, 
the obvious place to put lexical functions is in the 
semantic representation provided by HPSG. There 
are various reasons for this. One is that LFs are 
used in the deep syntax level in Mel'cuk's model, 
a level oriented towards meaning. Another rea- 
son is that this level seems most appropriate to be 
used in transfer/translation and because we want 
to use lexical functions in transfer, this is where 
they should be. In contrast to the ECD, the mean- 
ing of the collocate is represented by the lexical 
function only. 

The following is an example of the entry for crit- 
icism with the encoding of strong as a collocate.^ 
We use SEM_ind as an abbreviation for the feature 

path SEM.CONT.IND. 

PHON criticism 

VAR \T\ 

REST {criticism (|T|)} 
Sstrong 

VAR \T\ 



SEM.IND 



COLLS { 



SEM.IND 



REST {MagnQ} 



Just as in the ECD the base contains a specific 
zone in which the collocates are listed. In our case, 
the feature 'COLLS' has a set of lexical entries as 
its value. 

Each collocate subentry bears the value of the 
lexical function in its semantics field. In this rep- 
resentation the lexical function is chosen as the 
real semantic value of the collocate. One should 
read the feature structure as specifying that the se- 
mantics of strong (as a collocate) is the predicate 
Magn©. 

The collocate subentry only provides partial in- 
formation. In fact, it provides only the information 
that is specific to the occurrence of strong in its 
combination with criticism. In this case only the 
semantics is given. We further assume that the lex- 
icon also contains a 'super-entry' which provides 
all the information that is shared by all the differ- 
ent occurrences of strong. This entry is where the 
variable Sstrong points to. Of course, other archi- 
tectures that try to avoid redundant specification 
of information are equally possible. For instance 
if one assumes a mechanism of default unification. 



^ Nntipp that iiprp wp usq a simple version of HPSG based 
on ([ Pollard and Sag, 1987|) whereas the actual im plemen- 
tation was based on ([ [pollard and Sag, to appcar| ) . 



one can have Sstrong refer to the full entry de- 
scribing 'strong' in say its ordinary use, and have 
the values that are particular to the collocational 
strong overwrite the values provided in the ordi- 
nary entry, as in Mel'cuk's proposal. 

Collocations, Rules and Principles So far, 
we have not specified in what way one gets from 
the lexical entries for the base and the collocate to 
the representation of the collocational expression. 

In HPSG, the descriptions of complex expres- 
sions are constrained by principles. We will as- 
sume that collocations are subject to the same con- 
straints. The ordinary rules of combination (com- 
bining adjectives and nouns, for instance) thus ac- 
count for most of the properties of the colloca- 
tional combination. However, we are still left with 
the typical 'collocational restriction' which needs 
to be accounted for. 

We have therefore added a principle which says 
that constructions that are analysed as colloca- 
tions (indicated by the type COLLOCATION) are ei- 
ther head-adjunct structure or head-complement 
structures with specific restrictions holding be- 
tween the head and the adjunct or the head and 
the complement respectively. Let's consider the 
former case^], illustrated by the heavy smoker ex- 
ample. The adjunct daughter will contain the ad- 
jective collocate. In such collocational construc- 
tions the collocate adjuncts have to be 'licensed' 
by the noun or the head daughter. This is im- 
plemented by requiring that the collocates field 
(colls) of the head daughter contains a reference 
to a lexical entry that is compatible with the ad- 
junct daughter. In the literal reading of an expres- 
sion such as heavy smoker, the phrase will not be 
analysed as a COLLOCATION and the principle does 
not apply. 



[ ] COLLOCATION => 

" HEAD_DTR [ COLLS {...[!]...} ] 
ADJ.DTRS < -^COLLOCATE- > 

HEADJDTR 0COLLOCATE 
COMP.DTRS < ... COLLS.. ..[T]...- ... > 



3 Issues in Translation 

The project has tried to investigate the use of lex- 
ical functions as an interlingual device, i.e., one 
which is shared by the semantic representations of 
collocations in the language pairs^. 



4 To illustrate the case of head-complement structures 
one could take some support verb construction (also called 
light verb construction). 

5 For anot her application of LFs in a multilingual NLP 
context see ([ Heid and Raab, 1989[ ). For other treatments 

of cnlloc:i- 

tions in language generation see Q Nfirenburg et al., 1988 ) 
and ( fcmadja and McKeown, 199C |). 



The typing of a collocation with such a func- 
tion opens up the way to a treatment of colloca- 
tions inside a given language module and hence to 
a substantial reduction in the number of colloca- 
tions explicitly handled in the multilingual transfer 
dictionary. The existence of a collocation function 
is established during analysis. This information 
is used to generate the correct translation in the 
target language. To illustrate, the English analy- 
sis module might analyse (1) as (2). The transfer 
module maps (2) onto (3) which is then synthc- 
sised by the French module to (4). 

(1) heavy smoker — » (2) Magn(smoker) 
— * (3) Magn(fumeur) — > (4) grand 
fumcur 

The example points out that the translation 
strategy is a mixture of transfer and interlingua. 
The bases are transferred but the representation 
of the collocate is shared between the source and 
the target representation. This treatment of collo- 
cations rests, among others, on the assumptions 
that there are only a limited number of lexical 
functions, that lexical functions can be assigned 
consistently, that all (or a significant number of) 
collocations realise a lexical function, that lexi- 
cal functions are not restricted to particular lan- 
guages, etc. In the following paragraph we present 
an outline of the translation process. Next, we dis- 
cuss some of the problems which follow from our 
approach and we propose some ways to solve them. 



3.1 Lexical Functions as Interlingua 

It was assumed that the starting point for trans- 
fer is the semantic representation of the phrase. 
Using a semantic representation as input to trans- 
fer implies that we relate semantic values of words 
and phrases. For our purposes this is very sat- 
isfying since we will now be using the semantics 
of collocates instead of their orthography, in other 
words: we use lexical functions and abstract away 
from the particular realisation of a collocate in a 
particular language. 

We now state the relation between the seman- 
tic representations of the source language and tar- 
get language. The semantic relation between the 
phrase heavy smoker and its French counterpart 
can be made explicit in the following bilingual sign: 

VAR \T\ 

REST {smokcr(|T|),Magn(|T|)} 



EN— SEM.IND 



FR — SEMJND 



VAR \T\ 

REST {fumeur(|T|),Magn(|T|)} 



Typically, the lexicon will contain a bilingual 
sign for each possible value of RELN. Thus, for 
translating heavy smoker into grand fumeur we will 
need the obvious entry for smoker-fumeur plus the 
entry below: 



EN— SEMJND 



VAR [7] 

REST {Magn ([!])} 



FR— SEMJND 



b y some (e.g., ([Anick and Pustcjovsky, 1990| ), 



VAR \T\ 
REST {Magn([T])} 

The interlingual status of the lexical function is 
self-evident. Any occurrence of Magn will be left 
intact during transfer and it will be the generation 
component that ultimately assigns a monolingual 
lexical entry to the LF.0 

3.2 Problems 

Lexical Functions abstract away from certain nu- 
ances in meaning and from different syntactic re- 
alizations. We discuss some of the problems raised 
by this abstraction in this section. 

Overgenerality An important problem stems 
from the interpretation of LFs implied by their 
use as an interlingua — namely that the mean- 
ing of the collocate in some ways reduces to the 
meaning implied by the lexical function. This in- 
terpretation is trouble-free if we assume that LFs 
always deliver unique values; unfortunately cases 
to the contrary can be readily observed. An exam- 
ple attested from our corpus was the range of ad- 
verbial constructions possible with the verbal head 
oppose: adamantly, bitterly, consistently, stead- 
fastly, strongly, vehemently, vigorously, deeply, 
resolutely, etc. The function Magn is an appropri- 
ate descriptor in all cases since each adverb func- 
tions as a typical intensificr in this context. How- 
ever each adverb also denotes some other mean- 
ing aspect (s). The imprecision of LFs will mean 
that we have no means of distinguishing between 
the various intensifiers possible in the context of 
a given keyword, and hence will not have suffi- 
cient information to choose the most appropriate 
translation where, correspondingly, multiple possi- 
bilities exist in the target language. An important 
question here is how dramatic this loss of transla- 
tion quality really is. 

It is essentially in addressing the issue of over- 
generality that Mel'cuk introduces sub- and super- 
scripts to lexical functions, enhancing their pre- 
cision and making them sensitive to meaning as- 
pects of the lexical items over which they op- 
erate. Superscripts are intended to make the 
meaning of the LF more precise and hence more 
likely to imply unary mappings between argu- 
ments and values, subscripts are used to reference 
a particular semantic component of a keyword. 
The introduction of such devices into the account 
of LFs demonstrates both the need for precision 
and the fact that it does seem necessary to ad- 
dress semantic aspects of lexemes standing in co- 
occurrence relations. In fact it has been asserted 



6 For more details we refer the reader to ([Hcylcn, 1993]). 
There we also discuss our implementation in Alep, the 
C.E.C.'s unification-based grammar writing environment. 



(| Hcid and Raab, 1989 1 )) that collocational sys- 
tems are systematically predictable from the lex- 
ical semantics of nouns. In an attempt to ex- 
plore this notion further, we have investigated 
the approach to nominal semantics known as 
Qualia structure (| Pustejovsky, 199l| ) and consid- 
ered how this may complement the LF notion 
to improve its descriptive power]]. Among the 
promising avenues that occur to us are, firstly, 
the postulation of LF subscripts based on the four 
Qualia roles (assuming that these are the lexically 
most relevant aspects of noun semantics) and, sec- 
ondly, the application of LFs to semantic (Qualia) 
structures rather than monolithic lexemes; eg: the 
LF Bon is used in delivering evaluative qualifiers 
which arc standard expressions of praise or ap- 
proval. One could imagine application of the func- 
tion over the Constitutive and Agentive roles of 
the noun lecture, to deliver: 

Bon(Const : lecture) = informative 
Bon (Agent : lecture) = clear 

In both cases the idea is that the precision of 
the lexical function is essentially enhanced by ap- 
pealing to the semantic facets of its argument. 

Syntactic Divergences Another issue that has 
to be raised concerns the translation of colloca- 
tions into non-collocational constructions. If we 
are to maintain a consistent interlingual approach 
to the translation of these cases, we must extend 
our LF-based approach accordingly. We consider 
one case briefly. 

Cross-linguistic analysis reveals many cases 
where nominal-based collocational constructs are 
realised as compounds in Germanic languages, 
e.g., bunch of keys =>■ sleutelbos. A pos- 
sible account of such phenomena may be 
developed from the concept of merged LFs 
( fvIePcuk and Zolkovsky, 1970| ). Merged LFs are 
intended to be used in cases where a value lex- 
eme exists which appears to effectively reduce 
("merge") an LF meaning and its specified argu- 
ment to a single lexicalised form, rather than pro- 
jecting a syntagmatic unit. We could argue that 
in cases of compound formation, exactly the same 
process is to be accounted for, since the compound 
embodies both the concept mediated by the LF 
and its argument lexeme. We could therefore al- 
low compounds to be delivered as values of merged 
LF's, eg: //Mult(sleutel)= sleutelbos. 

These observations are useful in the MT context 
if we assume that we can effect a mapping between 
merged and unmerged LFs and therefore capture 
the correspondence between distinct structural re- 
alisations of the same concept. One way to emu- 
late such a mapping might be through the use of 



7 For a comparison bet ween aspects of Qua lia structures 
and lexical functions sec ([Hcylcn, to appear]). 



Mel'cuk's lexical paraphrasing rules. For instance, 
one could conceive of a lexical paraphrasing rule 
as follows^ W + Mult(W) //Mult(W). 

If we assume that in our monolingual English 
lexicon, we assign the collocate bunch as the Mult 
value of keyword key, and that accordingly in 
the Dutch lexical entry for sleutel we instantiate 
sleutelbos as the value of the merged LF / /Mult, 
then we can use the paraphrasing rule to effect a 
mapping between the two LF's and hence arrive at 
an interlingual approach to the translation of the 
example, despite structural mismatches, i.e., 

key + bunch[Mult(fcey)] 

sleutelbos[//Mult(sZewteO] 

Further examples exist where productive mor- 
phological processes (e.g., affixation^) lead to the 
lexicalisation in one language of concepts that ex- 
ist as syntagmatic constructs in another. Again, 
we suggest the use of merged LFs and correspond- 
ing mappings via lexical paraphrasing rules as a 
possible translation strategy in these cases. 

4 Summary and Conclusions 

In this paper we have discussed how the lexico- 
graphical concept of lexical functions, introduced 
by Mel'cuk to describe collocations, can be used 
as an interlingual device in the machine transla- 
tion of such structures. We have shown how the 
essentials of the ECD analysis can be embedded 
in the lexicon and grammar of a unification based 
theory of language. 

Our use of lexical functions as an interlingua as- 
sumes that the relevant aspects of the meaning of 
the collocate are fully captured by the LF. The LF 
therefore determines the accuracy of translations, 
which may be impoverished due to the generalised 
nature of basic LFs. We have suggested some ways 
in which LFs can be enriched with lexical semantic 
information to improve translation quality. 

The interlingua level reflects what is semanti- 
cally common to expressions which form transla- 
tional equivalents. It abstracts away from specific 
syntactic realisations. Given that collocations may 
translate as non-collocations, we also have to pro- 
vide a way to represent these expressions using lex- 
ical functions. We have provided an illustration on 
how to proceed in one such case. 

8 This is our own initiative - it seems to be the case 
as we examine the literature that neither LFs such as 
Magn, Bon etc (i.e., those representing standard quali- 
fiers/attributes) nor indeed merged LFs feature in lexical 
paraphrasing rules. We would argue that cross-linguistic 
analysis suggests that they should enter this domain; com- 
pound formation and other types of lexicalisation appear to 
be regular patterns of translation across many collocational 
constructs, as we illustrate here. 

9 One could think of an example such as mis-interpret. 
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